What is claimed is: 

1 . A method for calculating a local mean number of tasks for each processing element 
(PEr) in a parallel processing system, wherein each processing element (PEr) has a local 
number of tasks associated therewith and wherein r represents the number for a selected 
processing element, the method comprising: 

assigning a value (Er) to said each processing element (PE^); 

summing a total number of tasks present on said parallel processing system and 
said value (Er) for said each processing element (PE^); 

dividing the sum of said total number of tasks present on said parallel processing 
system and said value (Er) for said each processing element (PE^) by a total number 
of processing elements in said parallel processing system; and 

truncating a fractional portion of said divided sum for said each processing 
element. 

2. The method of claim 1 wherein said assigning a value (Er) to said each processing 
element (PE^.) comprises setting said value (Er) equal to a number between 0 and (N— I), 
where represents said total number of processing elements in said parallel processing 
system. 

3. The method of claim 2 wherein said assigning a value (Er) to said each processing 
element (PE^) further comprises giving a unique number to said each value (Er) for said each 
processing element PE^. 

4. The method of claim 1 wherein said assigning a value (Er) to said each processing 
element (PE^) comprises setting said value (Er) equal to said number for a selected processing 
element (r). 

5. The method of claim 1 wherein said value (Er) controls said truncating step such that 
said total number of tasks for said parallel processing system equals the sum of said local 
mean number of tasks for each processing element (PE^.) in said parallel processing system. 

6. The method of claim 1 wherein said local mean number of tasks for each processing 
element (PE^.) within said parallel processing system is equal to one of A!' and (Jf +1). 

7. The method of claim 1 wherein said assigning step, said summing step, said dividing 
step, and said truncating step are completed on a portion of said parallel processing system. 
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8. The method of claim 1 wherein said assigning step, said summing step, said dividing 
step, and said truncating step are completed on a line of said processing elements within said 
parallel processing system. 

9. The method of claim 1 wherein said assigning step, said summing step, said dividing 
step, and said truncating step are completed on a loop of said processing elements within said 
parallel processing system. 

10. The method of claim 1 wherein said assigning step, said summing step, said dividing 
step, and said truncating step are completed on an array of said processing elements within 
said parallel processing system. 

1 1 . The method of claim 1 wherein said assigning step, said summing step, said dividing 
step, and said truncating step are completed on an array of two or more interconnected 
processing elements within said parallel processing system. 

12. A memory device carrying a set of instructions which, when executed, perform a 
method comprising: 

assigning a value (Er) to said each processing element (PE^); 

summing a total number of tasks present on said parallel processing system and 
said value (Er) for said each processing element (PE^.); 

dividing the sum of said total number of tasks present on said parallel processing 
system and said value (Er) for said each processing element (PE^) by a total number 
of processing elements in said parallel processing system; and 

truncating a fractional portion of said divided sum for said each processing 
element. 
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